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ABSTRACT 

Consider the unstructured search of an unknown number I of items in a large unsorted 
database of size A^. The muhi-object quantum search algorithm consists of two parts. The 
first part of the algorithm is to generalize Grover's single-object search algorithm to the 
multi-object case (0, |, ||, 0) and the second part is to solve a counting problem to 
determine / ([^ 0)- this paper, we study the multi-object quantum search algorithm 
(in continuous time), but in a more structured way by taking into account the availability 
of partial information. The modeling of available partial information is done simply by 
the combination of several prescribed, possibly overlapping, information sets with varying 
weights to signify the reliability of each set. The associated statistics is estimated and the 
algorithm efficiency and complexity are analyzed. 

Our analysis shows that the search algorithm described here may not be more efficient 
than the unstructured (generalized) multi-object Grover search if there is "misplaced con- 
fidence". However, if the information sets have a "basic confidence" property in the sense 
that each information set contains at least one search item, then a quadratic speedup holds 
on a much smaller data space, which further expedites the quantum search for the first item. 
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1 Introduction 



Grover's quantum search algorithm, since its first pubhcation in 1996 (0), has become 
one of the most prominent algorithms in quantum computation. Its elegance has drawn the 
attention of numerous computer scientists, mathematicians and physicists, resulting in many 
research papers on this subject. Grover's original work P, [ill] dealt with a single-object 



search in a large unsorted database. He shows that his quantum algorithm has a quadratic 
speedup. Farhi and Gutmann p presents a continuous time, or "analog analogue" version, 
of Grover's algorithm and obtains a similar complexity. 

In practice, most of the search tasks consist of finding more than one item in a large 
database. Therefore the development of multi-object search algorithms is important. By 
utilizing the two most important ingredients in Grover's algorithm, namely, 

(i) the notion of amplitude amplification; and 

(ii) the dramatic reduction to invariant subspaces of low dimension for the unitary opera- 
tors involved, 

it is possible to generalize the algorithm to multi-object search. See the discrete-time case in 
Boyer, Brassard, H0yer and Tapp , and the continuous-time case in Chen, Fulling and Chen 
0. However, for multi-object problems, the number of search items is normally not given a 
priori and, therefore, its determination is crucial. This becomes a quantum counting problem. 
The problem was partly treated in Brassard, H0yer and Tapp but a complete solution 
did not seen to appear until Mosca's Ph.D. Thesis |]14| in 1999. The counting problem can 



be studied with the techniques of "eigenvalue kickback", phase/ amplitude estimations and 
quantum Fourier transforms (QFT). 

Excluding the computational complexity of the counting problem, the generalized, un- 
structured Grover multi-object search of / items in a database of items has computational 
complexity 0{^/N/l) versus the classical Q{N/{1 + 1)) (|jl^, p. 70]). So again we see a 
quadratic speedup. This is significant. Nevertheless, pragmatically, one usually can (and 
should) do much better than this because in most realistic search tasks there is additionally 
given partial information about the search targets, provided that one knows how to utilize 
such information. 

The mathematical modeling of the availability of partial information is challenging work. 
Obviously, there are varied situations of how such information can be given and how it can 
be encoded into the computer. Therefore, mathematical expressions intended to model those 
situations may be qualitatively different. This difficulty is further compounded by the fact 
that no quantum computers (QC) have been built and are currently in operation so far, as 
solutions to the modeling problem hinges very much on the addressing, retrieval and data 
structure designs of the future QC. At present, we do not yet know how to categorize all (or 
most) of the possible situations that may naturally arise, but we are continuing to probe in 
this direction to improve our understanding on this modeling aspect. Our work here, though 
rather simplistic in nature, hopefully could serve as a modest start to draw more research 
interest in the directions of structured search in the future. 

Consider the following hypothetical situation: 
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"Professor John Smith, an outdoors buff, goes to the hbarary. He 

requests the hbrarian to assist him to find the total number and 

the titles of the books published between 1/15/1990 and 6/15/1990 

on the subjects of hunting, fishing or hiking". (1.1) 

His search targets arc precisely given as follows: 

T = {book title x\x is published between 1/15/1990 and 6/15/1990, 

X is on hunting, fishing or hiking}. (1-2) 

The number of items in T is not known in advance; therefore, it involves a counting 
problem as well. A brute force multi-object (generalized) Grover search would proceed to 
find items in T among all books in the library's holding, denoted as A. This would require the 
crude 0{^jN/l) quantum complexity if T has cardinahty / and the library's book holding 
A has cardinality N. This would be inefficient. However, (most) libraries group books 
according to subject interests. Instead of searching T among A, we should search T among 

U ^2 U A3, where Ai, A2 and A3 denote, respectively, the set of book titles on hunting, 
fishing and hiking. This is intuitively clear to surely cut down search time even without 
mathematical justifications first. See (12) in §3. 

We call such sets Ai, A2 and A^ here (partial) information sets. These sets may not be 
disjoint from each other, such as example (1.1) here amply illustrates the fact that there 
are many books dealing with both hunting and fishing and, thus, they belong to Ai (1 A2. 
Inside a computer (whether quantum or electronic), each of such datasets like A^, i = 1,2,3, 
here occupies a block of memory space, with additional ordered/sorted data structure. For 
example, the dataset Ai containing all book titles on hunting may already be either sorted 
according to the alphabetical orders of authors' names or the chronological orders of time of 
publication, or both. Such ordered data structures are likely to even expedite search with 
possible exponential speedup; nevertheless, we will not consider or exploit any sorted data 
structure for the time being in this paper. 

Generally, for a given collection of information sets Ai, i — 1,2, .. . ,n, such that T C 
Ai U A2 U . . . U An, there is in addition a given probability distribution that weighs some 
sets Aj more heavily than the others, depending on the reliability or preferences of the 
information source. For example, in (1.1), if Professor Smith has indicated that fishing is 
his primary sporting interest, then his information set A2 ought to weigh heavier than Ai or 
As in his case. 

Now having offered the physical motivations in our study of the modeling of search with 
the availability of partial information, we proceed to treat the multi-object search problem 
related to an analogue QC design. 

2 Mult i- Object Search with the Availabihty of Partial 
Information on an Analogue Quantum Computer 

Let a large database consist of N unsorted objects {wj\l < j < N} = A, where is an 
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extremely large integer. Let T = {wj\l < j < 1} C Ahe the target set of search objects, 
where I is an unknown integer. The information about T is given as follows: 

(1) There is an oracle (or Boolean) function satisfying 

N f 1, J = 1,2,... .1 ,^ 

This function acts in the black box of QC and can be known only through queries. 

(2) There are n explicitly given information (sub)sets Aj, j — 1,2, ... ,n, such that 

= = 1,2, . . . ,kj} G A 

and 

r C Ai U ^2 U . . . U A„ (2.2) 

(3) There is a given probability distribution that assigns different weights to various subsets 
Aj, depending on the reliability or (searcher's) preference of that information set. Let 
such weights be called reliability coefficients and denoted as 



n 



{Q;, >0|i = l,2,... ,n,^a, = l} (2.3) 

In the QC, each object Wj G A is stored as an eigenstate \wj) which collectively form 
an orthonormal basis B = {|wi)b = 1,2,... ,N} of an iV-dimensional Hilbert space 7i. 
Let us denote C — span{\wj)\j = 1, 2, . . . , /} as the subspace containing all the eigenstates 
representing the search targets. Suppose we are given a Hamiltonian H in Ti and we are 
told that H has an eigenvalue E ^ on the entire subspace C and all the other eigenvalues 
are zero. The search task is to find an eigenstate \wj) in C that has eigenvalue E. The task 
for the first search item is regarded as complete when a measurement of the system shows 
that it is in a state \wj) G C 

The analogue quantum computer for implementing multi-object Grover's search is a 
quantum process modeled by the Schrodinger equation 



t-\m) = Hm)), t>o, (2.4) 



d 

~dt 

\m) 

where H, the overall Hamiltonian, is given by 

H^H + Hd, (2.5) 

where 

I 

H = Ej2\^j){wj\ (2.6) 
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is the Hamiltonian satisfying the aforementioned property that it has an eigenvalue E on C, 
with the rest of its eigenvalues being zero. Note that 



E ^ 



therefore the knowledge of / alone determines H; no knowledge of < j < 1} is 

required or utilized since it is assumed to be hidden in the oracle (black box). 

In (2.5), Hd is the "driving Hamiltonian"; its choice is up to the algorithm designer. 

Remark 2.1. Without the assumption (2.2) and (2.3), a "good" driving Hamiltonian to 
choose (0, §) is 

Hd = E\s){s\ (2.7) 
related to the initial state |s), where \s) is further chosen to be 

N 



the uniform superposition of all eigenstates. 

For the discrete-time case (@, generalized Grover "search engine" is chosen to be 

U = -Ish, (2.9) 

where 

lL = I-^H,h=I-2\s){sl (2.10) 
/ = the identity operator on the Hilbert space H. 

□ 

Since now we have the extra properties (2.2) and (2.3) at hand, based on the insights we 
have gained from the analysis of Grover's algorithm, it is not difficult to see that searching 
by using the initial state (2.8) is not necessary, because the useful component, namely, the 
projection of |s) in C, is too small compared with the component of \s) outside C: 

\\Pc{\s))f/\\PM\s))T = l/{N-l), 

where Pc is the orthogonal projection operator onto the subspace C, is the orthogonal 
complement of L, and ||.|| is the norm of Ti. 

Because of (2.2) and (2.3), instead of (2.8) it is now natural for us to choose 



j=i i=i 



(2.11) 
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where z/ is a normalization constant. From (2.11), we rearrange terms and simplify, obtaining 



e+R 



(2.12) 



i=l 



i=e+i 



where the first sum on the RHS above is composed of all the terms in £, and the second 
sum consists of the remaining R terms in 

Remark 2.2. With the choice of a different |s) as in (2.12), the state equation (2.4) now has 
a new initial condition which is different from the uniform superposition of all eigenstates 
given in (2.8). Biron, Biham, et al. |jl|, call this the choice of "arbitrary initial amplitude 
distribution" in their paper. The papers [|T], 0] have shown certain advantages of the choice 
of general amplitudes in the discrete time case even though their ideas are unrelated to our 
problem under treatment here. □ 

Theorem 2.1. Consider the Schrddinger equation 



ij^im) = H\m) = {H+ Hn)\m), t > o. 



(2.13) 



1^(0)) = \s), 

where H and Ho are given, respectively, by (2.6) and (2.7), and \s) is given by (2.12). Then 

(1) H and the evolution operator e'"^^^ have an invariant two-dimensional subspace 
V = span{\w), \r)}, with 



y 



(e \ ^/'^ e £+R 

Y:\p.n <i. H^iEftw. w-Tf^E' 
i=i / ^1=1 V ^ ~ y i=e+i 



PM), (2.14) 



On V, H and e admit 2x2 matrix representations 
H = E 



^-iHt ^ ^-iEt 



1 + y"^ y^l-y'^ 

cos{Eyt) — iy sm{Eyt) — a/1 — y'^i sm{Eyt) 
— a/1 — y'^i sm{Eyt) cos{Eyt) + iy sm{Eyt) 



(2) The state il){t) is given by 



(2.15) 
(2.16) 



^lj{t) = e-'"^\s) = e-'^^{[y cos{Eyt) - ism{Eyt)]\w) + y^l - y^ cos{Eyt)\r)},t > 0. 

(2.17) 

Proof: From (2.12) and (2.14), we have 



\s) = y\w) + Vl - i/2|r); 



(2.1^ 
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so 



\s){s\ = y'^\w){w\ + y^/l — y^{\w){r\ + + (1 — y'^)\r){r\. 



Also, note that 



For any vector v e V, we may use the spinor notation 

V = a\w) + b\r) = [a b]^; a, 6 e C 

Thus, 

Hv = {H + E\s){s\)v 

= E(^Pc+[y^\w){w\+yy/l-y^{\w){r\ + \r){w\) + (1 - y^)|r)(r|]) (a|^D) + 6|r)) 
= (a|tt;) + ay'^\w) + ay\/l — y'^\r)) + {hy^Jl — y'^\w) + 6(1 - y^)|r) 

e V. 



1 + vV^-y^ 





a 







(2.19) 



Obviously, H is invertible on V. Therefore, H(y) = V, and H has the 2x2 matrix repre- 
sentation (2.15) on V according to (2.19). From (2.15), we calculate the exponential matrix 
e-'^^ to obtain (2.16). 

The solution (2.17) for the state equation (2.13) follows from (2.17) and (2.18). □ 



2Ey' 



we 



Corollciry 2.2. Assume the same conditions as Theorem 2.1. Then at time T 
have \'il){T)) e C Consequently, after measurement it yields a first search item wj e T with 
probability for j — 1,2, . . . ,1, and total probability 1. 



Proof: Obvious from (2.17). 



□ 



Theorem 2.3. Assume the same conditions as Theorem 2.1. Define the following two vec- 
tors in V: 



(2.20) 



Then 



(i) Xi and X2 are the unique orthonormal eigenvectors of H on V, i.e., (2.15), corre- 
sponding, respectively, to eigenvalues Ai — E{1 -\- y) and \2 — E{1 — y); 



(ii) For each t > 0, the evolutionary operator e satisfies 



(2.21) 
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Proof: Straightforward calculations and verification. □ 
Even though Cor. 2.2 gives the informed answer that the quantum search process should 
be measured at time T = 7r/{2Ey) in order to obtain the first desired object, the trouble is 
that we do not know explicitly what the value of y is in order to determine T. Now Thm. 
2.3 affords the information that Xi and X2 are eigenvectors of H of e~*^*. We can apply the 
"eigenvalue kickback" and "phase estimation" techniques, first developed by Kitaev |T^, to 
estimate the crucial value of y. The quantum Fourier transforms (QFT) plays a central role 
in this approach; see a lucid introduction in Mosca |n]. 



Let us construct a unitary operator Q = e ^^(W'E)^ Then from (2.21) and (2.18), we 
have 



QXi = e-*2-2/Xi, QX2 = e'^^yX2, (2.22) 



Q^\s) = Q"^{y\w) + /r^|r)) = ( ^/^^i + / 



}_+y^-i2mnyj^^ + ^ i^e^^^-^^Xa, for m = 0, 1, 2, . . . . (2.23) 

Thus we see that y appears as a phase factor in (2.22) and (2.23). Further, y also appears 
in the amplitudes on the RHS of (2.23). 

We add an ancilla register \m), m = 0,1,2, .. . , M — 1, for a sufficiently large integer M 
and form 

M-l r A/-1 /—— — M-1 

m=0 m=0 m=0 

(2.24) 

For any given a; = 0, 1, . . . , M — 1, define QFTs J-'m and J^^l by 

^ M-l M-l 

For any u; G M, define 

l^)=^M (2-25) 
Applying to the first register in (2.24), we obtain 



1^2) = y^\y) ® ^2 + ^J^\~y) ® X,. (2.26) 

Now, measurement of the first register on the RHS of (2.26) will yield the state \y) or \1 — y) 
with probability and respectively. The state \y) or \1 — y) further collapses to one 
of the eigenstates j = 0,1,2, .. . , M — 1, of the first register. 
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Theorem 2.4. Assume the same conditions as Theorem 2.1. Let us measure the first 
register of {^2) on the RHS of (2.26), which collapses to one of the eigenstates j — 
0, 1, 2, . . . ,M — 1, of the first register. Then 



l-y 

with probability — - — ,V{\j — My\ < 1 



(a) with probability 



2 

1 + y 



^V{\j-M{l-y)\<l\\l-y))>- 



(2.27) 

(2.28) 



where 7^(^415) denotes the probability of an event A conditioned on the event B. 
Proof: First, note from the definition (2.25) that 



M-l 



M-l 



M-l 



M 
M-l 



k=0 1=0 



-i2kTTj/M 



\J) 



= J2 My)\k), 



k=0 



where 



oik{y) 



M-l 



M 



E 

i=o 



The probabihty that we wiU obtain \y) is (1 — y)/2. The measurement of \y) wiU then yield 
an eigenstate \k) with probabihty \ak{y)\'^. Our task now is to estimate akiy): 

M-l M-l 



p=Q j=Q 



M-l 



i27rj(y-^) 



j=0 



1 

M 

12 ; 



1 



1 

M 



sm{Tv{My k)) 



sin(7r(y - ^)) 



(2.29) 



We see in the above that |Q;fc(|/)| is maximized ii y = k/M, yielding |Q;fc(|/)| = 1, i.e., 



V{\k) happens \\y)) — 1. Thus, the above provides a way of measuring y in terms of M and 
k. 

In general, y is a real number. Therefore, we cannot expect the certainty V^k) happens \\y)) 
1 no matter how M is chosen. To treat the case 7/ e R, we first define, for any r e M, 

[r\ = the largest integer smaller than r, 
[r] = the smallest integer larger than r. 



For fixed M, denote A = Mv^^ = y _ i^. Then ^ - A 



[My] -My 
M 



Therefore, from (2.29), 

V{\My-k\<l\\y))=V{[My\ = k\\y)) + V{\My] = k\\y)) 

_ sin'(MA7r) sin'(M(^ - A)7r) _ 
~ M2 sin2(A7r) ^ sm''{jj - A)tt) ' 



IMA _ „ 

M y- 
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the RHS above attains minimum at A = grj, giving 



r{\My-k\<l\\y)) 



2M' 
1 . 1 



+ 



M^sin^i^) sin 



> 



,2( TT ' 

2 



Therefore (2.27) has been proven. 

The second possibihty is that, from (2.26), we obtain \1 — y) with probabihty 



i+s/. 



further collapses to \k') such that 

V{\k' -M{l-y)\ < l\\~y)) 



2 ' 



i-y) 



sin2(MA7r) sm'^{M{j^ - A)7r) 8 



> — . 



M2 sin'(A7r) sm\j, - A)7r) " tt 



where A = 



M{l-y)-[A4{l-y)] 
M 



□ 



Remark 2.3. (i) The quantum search procedures as culminated in (2.26) is hybrid in the 
sense that it operates concurrently on continuous (i.e., t ) and discrete (i.e., m in QFT) 
variables (Lloyd p!3|). 

(ii) In QC implementation, (assume that) qubits are used and, thus, M = 2" for some 
positive integer n. The circuit for estimating y from the ancilla register \m) (cf. (2.23)- 
(2.26)) may be found in Fig. 1. □ 



lm> ^2 



Is> 



QFT 



measure 



Qls> 



Fig. 1 Circuit for estimating y in ( p.26|) , where Xo,Xi,... ,Xn_i represent the 

ascending order of qubits and M = 2". 

From (2.29), we see that in the estimation of what matters is \sin{Tx{y — -g))| and, 
consequently, the relevant distance between our estimate k/M and y itself is not simply 
\y ~ j^]- A better measurement of distance is given as follows. 

Definition 2.1 ( ||14| , p. 45]). The distance d{yi,y2) between two real numbers yi and y2 
is the real number 



d{yi, 2/2) 



min 2/1 - 2/2 + J 
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i.e., d{yi,y2) makes the shortest arclength on the unit circle between e*^'^^^ and e*^'^^^ be 
2TTd{yi,y2). □ 

Corollary 2.5. Assume the same conditions as those in Thms. 2.1 and 2.4- Measurement 
of the first register of |\l/2) on the RHS of (2.26) will yield the state \k) such that 

(i) if My is an integer, then 'P{\k) happens ) = 1; 

(a) if My is not an integer, then 



V{\k) happens \y)) = \ ^'^[ < (2.30) 

sm^Mndjl-y,^)) 
M2sin2(vrrf(l-y,A)) " (2Mrf(l - 



V{\k) happens \\l-y))= . ' < — — (2.31) 



(Hi) 



V{d{l-y,^) < ^\\l-y)) > 4: 



(iv) for m > 1, 



ndiy4)<^\\y))>i-^^^y (2,32) 

^''"^-^•s'^sli~^>'^^-2(;ib)- <2-^3) 

Proof: Many estimates are already clear from the proofs given previously. The rest can 
be established using pp. 45-46] as follows. 
It is clear from (2.29) that 

V{\k) happens \\y)) = \ T.V 2.34 



Using the fact that 2x < sinvra; < irx for x G [0, 1/2], from (2.34) we obtain 

P(|fc> happens IIS)) <-ij_2^, 

which proves (2.30). We can similarly prove (2.31). 
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To show (2.32), we have 

-)< 



nd{y, 4) < ?-A\y)) = n\My -k\< m\\y)) 

= l-V{\My -k\> m\\y)) 

M 



>l-Y,^{\My-k\=j\ 

j=m 

oo 

>l-J2Vi\My-k\^j\\y)) 

j=m 
oo 

>i-2y — j^>i- , ^ 

,t;;.4M2(X)2- 2(m-l) 
The estimate (2.33) also follows similarly. □ 



3 Efficiency and Complexity 

Let us address various relevant issues in this section. 

(II) Will the search algorithm with the availability of partial information given 
in §2 always be more efficient than the unstructured Grover multi-object seeirch 
algorithm? 

The answer is NO. A simple counterexample is sufficient to demonstrate this point. Let 

TCA1UA2, TCAi, r n ^2 = 0; Ai, A2 C A. (3.1) 

Assume that the cardinalities of, respectively, T,A,Ai, and A2, are /, A^, ni and n2. Let 
the reUability coefficients be {Q;i,a2}, where q;i,q;2 > 0, q;i + q;2 = 1. Then by (3.1), we 
easily see that 

A = ai/z/,z = l,2,... ,/; (cf. (2.11), (2.12)) 

= [(ni-ni2)af + nl2(al + a2)^ + (r^2-r^l2)Q!i]^/^ ) (3.2) 



V 



ni2 = the cardinality of Aif] A2. 



Thus 



y = 



j=i 



and by Cor. 2.2, the time T required to reach C is 



T=^ = ^^. (3.4) 



If q;2 is very close to 1, then it is easy to see from (3.2)-(3.4) that 

lim r = oo. 

a2—*l~ 
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Therefore this algorithm is not efficient when 0:2 is close to 1. (Conversely, if 0:2 is close 
to 0"*", then we see that the algorithm will be efficient.) 

It is obvious to see what causes the trouble. In (3.1), we see that the information set A2 
is irrelevant to the search target set T(i.e., T n A2) but too heavy weight a2 is assigned to 
the set A2. This is a situation with misplaced confidence on the set A2. It is definitely to be 
avoided. The opposite situation of which is called by us one with basic confidence. 

Definition 3.1. Consider (2.2). If fl T 7^ for j = 1,2, . . . ,n, then we say that we have 
basic confidence in the partial information sets Ai, A2, . . . , An- □ 

(12) Will the search algorithm in §2, with the additional assumption of basic 
confidence, be more efficient than the unstructured Grover multi-object seeirch 
algorithm? 

The answer is YES. The following theorem shows that we still maintain a quadratic 
speedup of Grover. 

Theorem 3.1. Consider (2.2) and assume that we have basic confidence. Then we have 

1 \ 1/2 



vi=l 



where I + R is the totality of distinct elements in Ai U . . . An. Consequently, the time T 
required for \ip{T)) to reach L is 

Proof: Comparing (2.11) and (2.12), we have, for each j = 1, . . . , /, 



where 



ai if \wj) G Ai, 



^^'^ 1 otherwise. 



Therefore, 



I I / n \ 2 -i 

j=i 3=1 \i=i J i=i 

by the assumption that we have basic confidence and the inequality (a + 6)^ > + if both 
a and b are positive. Also, 

n 
1=1 
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under the constraint that Y17=i'^i — 1- This follows from the Lagrange multiplier method 
(or the Cauchy-Schwarz inequality). 

From (2.11), the normalization constant u takes minimal value where the sets Ai, . . . , An 
are in totality orthonormal, and takes maximal value when it happens that Ai = A2 — . . . — 

= {\wj) |i = 1, 2, . . . , / + R}. Thus 

< i/' < Z + i?. (3.9) 

j=i i=i 

By (3.8) and (3.9), we obtain 

1=1 ^ ' 

and hence (3.6). □ 

Corollary 3.2. Assume basic confidence. If I + R — 0{N^) for some small 5 > 0, then the 

search task for the first item will be completed in time duration T — 0{n^^'^N^^^), where n 
is the cardinality of the set {Ai, . . . , A^}. □ 

Normally, if the partial information sets are very descriptive in the sense that / + 7? is 
small, say, I + R = 0{N^) with 5 <^ 1, then the search algorithm in §2 will be more efficient 
than the unstructured Grover search. 

Remark 3.1. (i) The estimate (3.6) is obtained under the possibility that Ai = A2 = 
. . . = An = {\wj)\j = 1, 2, . . . , / + R}, which is a rare and trivial happenstance (that 
all information sets coincide). The other extreme is that there is no overlapping at all 
between the information sets, i.e., A^ fl = for any i,j e {1, 2, . . . , n}, i ^ j. Then 
under the assumption of basic confidence the conclusion in Cor. 3.2 still maintains its 
order of optimality . See (ii) and Cor. 3.3 below. 

(ii) By observing (2.11) and (2.12), we see that for the example (1.1) and (1.2), any wj^^ G T 
such that Wjg E Ai (1 A2 Cl A3 will have a larger weight because is repeated in 
all Ai, A2 and A^. As a consequence, this Wj^ is likely to be the outcome as the search 
of the first item. This means that a book title including all the interests in hunting, 
fishing and hiking is more likely to turn up than the other titles as the outcome of 
search. This can be undesirable, however. The only way to avoid this from happening 
is to eliminate the repetitions (or overlappings) between all Ai,A2 and A3 (and, in 
general, between all Ai, A2, . . . , An). Indeed, under the assumption that A^ n Aj = 
for all i, J e {1, . . . , n} and i 7^ j, we have (from (2.11)) 

n kj 

= ^ ^ Q;| = kxQ({ + k2Q(i + . . . + knO?n 
j=l i=l 

<{ki + k2 + ... + kn){al + al + ... + al) 

n 

= (^ + i?)^a^ (3.10) 

i=l 
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Using (3.10) in (3.7), we obtain 



2 1 



l + R 
and hence 



□ 



Corollciry 3.3. Assume basic confidence and that A^nAj — for all i, j & {1,2, . . . , n}, i ^ 
j. Then 

Consequently, if I + R = 0{N^), then the search task for the first item will be completed in 
time duration T — 0{N^/'^^ independent of n. □ 

(13) Can we determine the cardinality of T, using the algorithm in §2? 

The answer is NO, unless we do extra work. In general, because the choice of reliability 
coefficients {aj}^^i is somewhat arbitrary, the cardinality / of T will not be manifested in 
y. Even if we choose uniform weights = l/n,i = 1,2, .. . ,n, for the information sets 
Ai, . . . ,An, we still are unable to estimate y (or 1 — y) because elements in T may have 
repeated appearances in Ai, A2, . . . , When all the Aj's are disjoint, then 



y = 



I 



l + R 



One can thus estimate I and R based on y and 1 — y from Cor. 2.5, as it is usually done in 
solving the quantum counting problem. 

Because the information sets ^1,^42,... , An generally have some overlapping, we need to 
eliminate such overlapping first through some processing in order to do counting. 

(14) The choice of different information sets 

The example stated in (1.1) so far has been treated by choosing the information sets Ai, 
A2 and A^ as denoted in the paragraph following (1.2). Instead, one can choose just a single 
information set 

Aq = {book title x^x is published between 1/15/1990 and 6/15/1990}. 

Then the search of T will be carried out in Aq. As we expect the cardinality of Aq will 
be much larger than the sum of the cardinalities of Ai,A2 and A^, this search will be less 
efficient. 

The choice of information sets seems to rely on the human operator as well as on how 
the data are encoded. 

(15) The work involved 

For each estimation of y, we need 0((logM)^) number of operations. With each (es- 
timated) value of y, we require time duration T = in order to obtain the first search 
item. 
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